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Abstract 

We derive a Maximum Principle for optimal control problems with constraints given by the cou¬ 
pling of a system of ODEs and a PDE of Vlasov-type. Such problems arise naturally as P-limits of 
optimal control problems subject to ODE constraints, modeling, for instance, external interventions 
on crowd dynamics. We obtain these first-order optimality conditions in the form of Hamiltonian 
flows in the Wasserstein space of probability measures with forward-backward boundary conditions 
with respect to the first and second marginals, respectively. In particular, we recover the equations 
and their solutions by means of a constructive procedure, which can be seen as the mean-field limit 
of the Pontryagin Maximum Principle applied to the discrete optimal control problems, under a 
suitable scaling of the adjoint variables. 

Keywords: Sparse optimal control, mean-field limit, F-limit, optimal control with ODE-PDE 
constraints, subdifferential calculus, Hamiltonian flows. 


1 Introduction 

The study of large crowds of interacting agents has received a significantly growing attention in the 
mathematical literature of the last decade with applications in biology, ecology, social sciences, and eco¬ 
nomics. Starting from the seminal papers [25, 27, 37, 39], emphasis has been put on self-organization, 
i.e., the formation of macroscopic patterns from the superimposition of simple, reiterated, binary in¬ 
teractions. A quintessential situation is the convergence of a crowd to a common state, which may be 
called consensus, agreement, or rendezvous. Several examples show that spontaneous convergence to 
pattern formation is not always guaranteed, e.g., for highly dispersed initial configurations in consensus 
problems [17, 18, 21, 30], hence, the issue of controlling and stabilizing these systems arises naturally. 
Two major interpretations of control of multiagent systems have received much attention; on the one 
hand, with the decentralized approach, the problem is recast into a game-theoretic framework, where 
agents optimize their individual cost and solutions correspond to Nash equilibria. On the other hand, 
following the concept of centralized intervention, an external policy-maker controlling the dynamics is 
introduced, with the task of minimizing its intervention cost. 
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When dealing with large populations, in both cases one faces the well-known problem of the curse 
of dimensionality, term first coined by Bellman precisely in the context of dynamic optimization: the 
complexity of numerical computations of the solutions of the above problems blows up as the size of 
the population increases. A possible way out is the so-called mean-field approach, where the individual 
influence of the entire population on the dynamics of a single agent is replaced by an averaged one. 
This substitution principle results in a unique mean-field equation and allows the computation of 
solutions, cutting loose from the dimensionality. 

In the game-theoretic setting, the mean-field approach has led to the development of mean-field 
games [26, 29], which model populations whose agents are competing freely with the others towards the 
maximization of their individual payoff, as for instance in the financial market. The landmark feature 
of such systems is their capability to autonomously stabilize without external intervention. However, 
in reality, societies exhibit either convergence to undesired patterns or tendencies toward instability 
that only an external government can successfully dominate. The need of such interventions, together 
with the limited amount of resources that governments have at their disposal, makes the design of 
parsimonious stabilization strategies a key issue, which has been extensively studied in the context of 
dynamics given by systems of ODEs, see [7, 8, 9, 10, 13]. 

Nevertheless, the concept of sparse control has to be handled with care when trying to generalize 
it at the level of a mean-field dynamics. Indeed, the indistinguishability of agents is a fundamental 
property of the mean-field setting, and it is in sharp contrast with controls acting sparsely on specific 
agents. Figuratively, trying to stabilize a huge crowds with these controls is like steering a river by 
means of toothpicks! A first solution to this ambiguity was given in [6, 23], where the control is defined 
as a locally Lipschitz feedback control with respect to the state variables, and sparsity refers to its 
property of having a small support. Such concept was successfully used in [33] to implement sparse 
stabilizers for a consensus problem. This interpretation of sparsity appears also in the framework of 
the control of more classical PDEs, see [16, 34, 35, 38]. An alternative solution for a proper definition 
of sparse mean-field control was proposed in [22], where the control is sparsely applied on a finite 
number of individuals immersed in the mean-field dynamics of the rest of the population, resulting in 
a system where the controlled ODEs are coupled with a control-free mean-field PDE (but indirectly 
controlled via the coupling). The same kind of control was considered in [1] to model the efficient 
evacuation of a large crowd of pedestrians with the help of very few informed agents. 

While in the context of mean-field games and optimal control problems with PDE constraints, 
first-order optimality conditions have received enormous attention, see for instance [5, 11, 14, 36], up 
to now no corresponding results have appeared in the literature for coupled ODE-PDE systems of the 
kind considered in [22], to the best of our knowledge. This paper is devoted to the development of 
a Pontryagin Maximum Principle to characterize optima of such control problems. We first remark 
that we are not interested in all possible optima, but mainly on those which arise as limits of optimal 
strategies of the original discrete problems. We call this subclass of the set of optima mean-field 
optimal controls (see Definition 1.4). The interest in this class complies with the wish of using the 
continuous models as approximations of the finite-dimensional ones. Furthermore in the model cases 
considered in [22, 23], it is exactly the existence of mean-field optimal controls that is proved. 

We summarize our result, borrowing a leaf from the diagram in [14], as follows: 
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We shall provide a set of hypotheses for which the dashed line from the upper-right to the bottom- 
right box is valid. Our strategy shall be the following; we apply the Pontryagin Maximum Principle 
(see e.g. [15, Theorem 23.11]) to the finite-dimensional optimal control problems (the solid line from 
the upper-left to the bottom-left box), and we pass to the mean-field limit the system of equations 
obtained with this procedure (the solid line from the bottom-left to the bottom-right box). The 
derived limit equation for the state and the (rescaled) adjoint variables are obtained in the form of 
Hamiltonian flows in the Wasserstein space of probability measures, in the sense of [3]. The result 
will be a first-order condition valid for all mean-field optimal controls. The existence of such controls 
is also proved (see Corollary 2.15). 

More formally, we are interested in deriving optimality conditions for the solutions of the following 
optimal control problem subject to coupled ODE-PDE constraints. 


Problem 1. For T > 0 fixed, find u* G L^([0,T];ZY) minimizing the cost functional 

F{u)= [ [L{y{t),ir{t))+j{u{t))]dt, (1.1) 

Jo 

where (y, y) solve 

ivkit) = {K * p,{t)){yk{t)) + fk{y{t)) + Bku{t), k = l,...,m, 

\dty(t) = -Va; • [{K -k y(t) + g{y{t)))y{t)] , 

for the given initial datum (y( 0 ),^( 0 )) = {y^,yP) G x Pc(IK'^)- 


Here, 7 is a strictly convex cost functional, the finite dimensional set of controls U is convex and 
compact, Bk are constant matrices, and is the set of probability measures on with compact 

support. 

We shall prove the following main result. 

Theorem 1.1. Fix an initial datum {y^,y^) G x 'Pc(I^^) o-nd assume that Hypotheses (H) in 
Section 1.1 hold. Then there exists a mean-field optimal control for Problem 1. Furthermore, if 
u* is a mean-field optimal control for Problem 1 and {y*,fi*) is the corresponding trajectory, then 
{u*,y*, y*) satisfies the following extended Pontryagin Maximnm Principle.' 
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There exists {q* {■), i^* {■)) G Lip([0, T]; x 'Pi(R^'^)) such that 

• there exists Rt > 0, depending only on ,supp{iJ,^), d, K, g, and T, such that 

supp(i/*(-)) C B{0,Rt) and it satisfies v*{t){E x R'^) = g,*{t){E) for all t G [0,T] and for 
every Borel set E C R'^; 


it holds 


ijl =VqMy*,q*,iy*,u*), 

<ik = 

dtu* = -V■ {{JVu^c{y\q\yfiu*))v*). 
u* = arg max^gw ^c{y* ^q* ,u) 


(1.3) 


where J G ]^ 2 dx 2 d symplectic matrix 


J = 


0 Id 


the Hamiltonian 


-Id 0 

^ 2 dm ^ X R^ ^ R is defined as 


-Xy,q,^,u) = 


EI( 2 /, q, 12 , u) if supp(z^) C B{0, Rt), 


+00 elsewhere; 

and H : x x R zs defined as 

'E.{y,q,u,u) =- f {r — r') ■ K{x — x') du{x,r) dh'{x',r') + [ r ■ g{y){x)du{x,r) 

2 jTBid .ht^d 


+ EL qk ■ K{yk - x)du{x,r) + • {fk{y) + Bku) - L{y,T:i#u) - -;{u). 

(1.4) 


k=l' 


k=l 


• the following conditions for system (1.3) hold at time 0.’ y*{0) = y^ and 12*{0)(E x 
yP{E) for every Borel set E , 

• the following conditions for system (1.3) hold at time T: q*{T) = 0 and u*{T){'Sfi x E) = 
So{E) for every Borel set E , where do is the Dirac measure centered in 0. 


As already mentioned, the formulation given above shows that the dynamics of {y*,q*, 12 *) is 
essentially an Hamiltonian flow in the Wasserstein space of probability measures with respect to state 
and adjoint variables with Hamiltonian H, in the sense of [3]. The definition of Etc is introduced to 
simplify some technical details and does not alter the result. This fact is remarkably consistent with 
the dynamics (1.2), since both are flows in a Wasserstein space. We believe that this formulation of 
the optimality conditions making use of the formalism of subdifferential calculus in Wasserstein spaces 
of probability measures constitutes one of the novelties of the work. 
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Remark 1.2. For every {y,q,i') with supp(j^) C B{0,Rx), (1-4) immediately implies that 


G argmax • BkU - 7(1 

'itC-lJ 1 * ^ 


u G argmaxBIc(y, q, u, u) 
u&U 


Then, the strict convexity of 7 and the convexity and the compactness of U imply that u is 
uniquely determined by {y, q, v). This is the reason why we write the equality symbol in u* = 
argmax^g^ BIc(y*) q* ■, in place of an inclusion. 

We point out the difference between the usual gradient in with respect to the state variables 
X and the adjoint variables r, denoted by and the Wasserstein gradient of He, which, as 

shown in Section 4, whenever v has supported contained in B{Q,Rt) can be computed explicitly as 
follows; 

• For / = 1,..., d, it holds 

VJS.c{y, q, u){x, r) - 61 = (r - r') ■ {T>K{x - x')ei) du{x', r') + r ■ {'D^g{y){x)ei) 

JRZ'* 

m 

-^Qk- (DK{yk - x)ei) - V^£(y,x, / ujy) ■ e; 
k=l 

- (y‘;^{y,x,jujy)T)uj{x)) -ei. 

These are the components of V^Ilc{y,q,u,u){x,r) in the x; coordinates. 

• For / = d + 1,..., 2d it holds 

VMy,q,i^,u){x,r)-ei= [ K^x - x') ■ ei.ddu{x',r') + giy){x) ■ ei-d- (1.6) 


These are the components of V;yHc(y, g, i^, u)(x, r) in the ri-d coordinates. 


Smd. Tn>d\ 


are related to the 


In (1.5) and (1.6), the functions i G x x M'^;M) and u G C^(M‘^;M^) are related to the 

functional L in (1.1) via 

L{y,y)= i{y,x,Ju}fi) dg{x), 

jRd 

where Jujg := w/r(M'^), while and denote the partial derivatives of the function 

and Da;(x) is the Jacobian of the function w evaluated at x. Notice that V,yM{y,q,i',u) actually 
does not depend on u, as a consequence of the fact that the control does not act directly on the PDF 
component of ( 1 . 2 ). 

The main tool we use to prove Theorem 1.1 is the Pontryagin Maximum Principle (henceforth, 
simply addressed as PMP) for optimal control problems with ODE constraint. We shall apply it to the 
following finite-dimensional problems, whose constraints converge to the coupled ODE-PDE system 
of Problem 1, as we will show in Section 2. For this reason, we call Theorem 1.1 the extended PMP. 


Problem 2. For T > 0 fixed, find u* G L^{[0,T];U) minimizing the cost functional 


Fn{u) = / [L{y{t),gN{t)) + 7(«(i))] dt, 

Jo 
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where (y,/iAr) solve 


yk = jj Yl^=iK{yk - Xj) + fk{y) +BkU, k = l,...,m 
Xi = TrT,f=iK{xi - Xj) + g{y){xi), i = 


for the given initial datum (?/(0),x(0)) = {y^,x^) G x where 




is the empirical measure centered on the trajectory x{-) = (a;i(-),... ,xn{-))- 


The extended PMP will be derived after reformulating the finite-dimensional PMP applied to 
Problem 2 in terms of the empirical measure in the product space of state variables Xi and adjoint 
variables pi, defined as 

1 ^ 

UN{x,r) = — '^d{x - Xi,r - Npi). 

i=l 

Notice that rescaling the adjoint variables pi by the number N of agents is needed in order to observe 
a nontrivial dynamics in the limit (see also Remark 3.7); indeed, within this scaling, the right-hand 
side of the finite-dimensional PMP is brought back to the form considered, for instance, in [19], with 
a different Hamiltonian. 

The following diagram recollects the strategy of the proof, making use of the notation already 
introduced and reporting in which part of the paper each result is proved: 



The structure of the paper is the following. In Section 1.1 we recall notations and the main 
Hypotheses (H). In Section 2, we study the controlled dynamics subject to a coupled ODE-PDE 
constraint of the form (1.2), establishing existence and uniqueness results for solutions. In Section 3 
we study the finite-dimensional Problem 2, and apply the PMP to it. In Section 4, we recall basic 
facts about sub differential calculus in Wasserstein spaces, and we explicitly compute In Section 

5, we prove the extended PMP, i.e.. Theorem 1.1. Finally, Section 6 is devoted to the study of an 
interesting example of Problem 1, the Cucker-Smale system. 


1.1 Notation and Hypotheses (H) 

We start this section by recalling the notation used throughout the paper. 
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The constants d, D are two positive integers (the dimension of the space of the agents and of the 
control, respectively), T > 0 (the end time of the optimization procedure), and U is a convex compact 
subset of (set in which controls take values). 

Functionals have the following expressions; K : —)■ each satisfies —)• and 

for every y G and y. G Pi(M‘^), g{y) : —)■ and L{y,y) : —)■ M. The matrices are 

constant d x D matrices. 

The space P(M”) is the set of probability measures which take values on M”, while the space^ 
Pp(M”') is the subset of 'P(M"') whose elements have finite p-th moment, i.e.. 



x\\^dy{x) < + 00 . 


We denote by Vc{^^) the subset of Pi(M"') which consists of all probability measures with compact 
support. Notice that, if (^n)neN is a sequence in and it exists R > 0 such that supp(/Un) ^ 

B{0,R) for all n G N, then (^n)neN is compact in Pp(R"') for all p > 1. 

For any y G P(M"') and any Borel function r : M”! —)■ , we denote by r^yy G V{MR^) the 

push-forward of y through r , defined by 

r^y{B) := y{r~^{B)) for every Borel set B of 

In particular, if one considers the projection operators tti and 7r2 defined on the product space 
M""! X , for every p G P(R"'i x R"'^) we call first (resp., second) marginal of p the probability 
measure vri^p (resp., tt 2 #p)- Given y G P(R"'i) and u G 'P(R"'2), we denote with T{y,i') the subset 
of all probability measures in 'P(R"'i x R"'^^ with first marginal y and second marginal v. 

On the set Vp{W^) we shall consider the following distance, called the IFassersfefn or Monge- 
Kantorovich-Rubinstein distance, 


= inf 


k - y\ddp{x,y) : p G T{y,v) 


If p = 1 we have the following equivalent expression for the Wasserstein distance: 


(1.9) 


yVi{y, v) = sup 


Lp{x)d{y — i'){x) : (p G Lip(R"'), Lip((p) < 1 


We denote by To{y, n) the set of optimal plans for which the minimum is attained, i.e., 

p^To{y,v) p G F(p,z^) and / \\x - y\fdp{x,y) = Wl{y,v). 

It is well-known that Toiy,^) is non-empty for every {y,v) G 'Pp(R"') x 'Pp(R"'), hence the infimum 
in (1.9) is actually a minimum. For more details, see e.g. [40, 4]. 

For any y ^ Vi (R*^) and K : —)■ R*^, the notation K x y stands for the convolution of K and 

y, i.e.. 


{K-k y){x) = f K{x — x')dy{x'); 

Jmd 

this quantity is well-defined whenever K is continuous and sublinear, i.e., there exists C such 
that ||iF(.^)|| < C{1 -|- ||.^||) for all ^ G R'^. Furthermore we shall deal also with the convolution 
(V(j,/,.q(r', iF(x'))) -ku in R^*^, whose explicit expression is 

{{"^{x',r’){r',K{x')))-ku) {x,r) = [ - r',K{x - x'))) dv{x',r'). 

^We follow the notation of [4]. 
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Notice that, under the hypotheses we are going to make, this convolution is not always well-defined 
for V G 7^1 It is nonetheless well-defined for measures v G , that is to say for all the 

cases that will appear in the sequel. 

We shall denote with space of bounded Radon vector measures from R""! to 

R”^, and with || • ||;v((,(k"i;K" 2 ) the total variation norm on it. If w G C(R'^;R'^) is sublinear and 
fi G Pi(R'^), the Radon measure ujfj, G A^fe(R'^;R'^) is defined as 

uj^{E) := / (jj{x)d^{x), for every £' C R*^ bounded. 

Je 

We shall denote with f ujfj, := W|u(R'^). 

In what follows, we shall consider the space X := R'^’” x 'Pi(R'^), together with the following 
distance 

where \\y - y'\\ := YIk=i hk - y'k\\i 2 m • 

Finally, for every G N, the mapping IItv : R^*^^ —)• 'Pi(R^'^) is defined as follows 

1 ^ 

Rat : (xi,pi, . . . ,XAr,PAr) -Xi,- - Npi). 

2=1 

Henceforth, we assume that the following regularity properties hold. 

Hypotheses (H) 

(K) The function K G C^(R'^;R‘^) is odd and sublinear, i.e., there exists Ck > 0 such that for 
all X G R'^ it holds 

||iF(x)|| <C;,(l + ||x||). 

(L) The function L : R'^’” x 'Pi(R‘^) —)■ R is 

L{y,p)= [ £{y,xjup)dy{x), 

jRd 

with i G xR^x R'^; R) and u G R'^). 

(G) The function g G C^(R'^™;C^(R'^; R'^)) satisfies for all x G R*^ and all y G R'^’” 

g{y){x) ■ X < Gi\\x\\^ + G 2 max \\yi\f + G 3 , 

2 = 1 ,...,m 

where the constants Gi,G 2 and G 3 are independent on x and y. 

(F) For each k = 1,... ,m, the function fk G C^(R'^™;R'^) satisfies for all y G R'^’” 

fk{y) -Vk^Fi max \\yif + F 2 , 

2 = 1 ,...,m 

where the constants Fi and F 2 are independent on y and k. 

(U) The set U C R^ is compact and convex. 

( 7 ) The function 7 : 2/ —)■ R is strictly convex. 


( 1 . 10 ) 


( 1 . 11 ) 





Remark 1.3. We briefly compare Hypotheses (H) with those of [5, 11]. In [5], which deals with an 
SDE-constrained optimal control problem, functionals with respect to state variables and the 

control are considered. Therefore our hypotheses are just slightly more restrictive. On the other 
hand, we do not require differentiability of the running cost. The authors of [11] deal, instead, with a 
mean-field game type optimality conditions to model evacuation scenarios. They derive a first-order 
condition under the hypotheses of continuous differentiability of the functionals with respect to the 
state variables together with convexity and positivity assumptions. Furthermore, they deal specifically 
with an control cost, while we allow ours to be strictly convex. 

We now give the rigorous definition of mean-field optimal eontrol. 

Definition 1.4. Let {y^,pP) G x VcO^'^) be given. An optimal control u* for Problem 1 with 
initial datum is a mean-field optimal control if there exists a sequence {uN)N&n C L^([0,T];ZY) 

and a sequence {y%)N&i £ T’c(M‘^) such that 

(f) for every A" G N, y%{-) '■= n) ^ sequence of empirical measures for some 

£ supp(//^) -|- H(0,1) such that weakly* in the sense of measures; 

(m) for every A G N, u*j^ is a solution of Problem 2 with initial datum {y^,y^); 

{Hi) there exists a subsequence of {uN)NeN converging weakly in L^([0,T];A) to u*. 

Remark 1.5. As mentioned before, the above definition is motivated by our interest in optimizers 
that are close to optimal controls for the original finite-dimensional problems. Notice also, that since 
the measures have all compact support contained in supp(;U°) -|- R(0,1), the build a compact 
sequence in Vp{W^) for all p > 1, and therefore, due to weak* convergence to yP, we also have that 

limAT^oo >Vp(/r5,r,/U°) = 0. 

2 The coupled ODE-PDE dynamics 

In this section, we first recall results for PDF equations of transport type with nonlocal interaction 
velocities, like the one appearing in the second equation of (1.2). We then study the coupled ODF-PDF 
dynamics (1.2) and we state existence and uniqueness results of solutions, together with continuous 
dependence on the initial data {y^, y^) and on the control u. The proofs follow closely in the footsteps 
of similar results in [3, 22, 31, 32]. We also show that finite-dimensional ODF dynamics (1.8) are 
embedded in (1.2), in the sense that the solution of (1.2) with an initial data that is an empirical 
measure coincides with the empirical measure with support on the solution of (1.8). 

2.1 Transport PDE equations with nonlocal interaction 

In this section, we study equations for the dynamics of measures, recalling results of existence and 
uniqueness. We first define the meaning of solution for the equation 

dty{t) = -Va: • {v{t, X, y{t))y{t)), (2.1) 

where v : [0,T] x x Pi(M'^) —)■ is a given vector field and n G N is the dimension of the 

underlying Euclidean space. 

Definition 2.1. We say that a map y : [0,T] ^ Pi(M"^) is a solution of (2.1) if the following holds: 
(f) y has uniformly compact support, i.e., there exists R> 0 such that supp(/r(-)) G B{0,R); 
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(a) fi is continuous with respect to the Wasserstein distance Wi; 

{in) satisfies (2.1) in the weak sense, i.e. (see [4, Equation (8.1.4)]), 

4:/ Hx)dfi{t){x) = [ V(j){x) ■ v{t,x,n{t))dfi{t){x), 

for every cj) G C“(M"';M). 

Now, we can formally define the concept of solution of the controlled ODE-PDE system (1.2), 
which applies, mutatis mutandis, to system (1.3) as well. 

Definition 2.2. Let u G L^{[0,T];l{) and {y^,iJ,^) G X, with of bounded support, be given. We 
say that a map {y,fi) : [0,T] —)■ Af is a solution of the system (1.2) with control u if 

(z) (y( 0 ),/i( 0 )) = (yO,MO); 

(if) the solution is continuous in time with respect to the metric (1-10) in X; 

(iff) the y coordinates define a Caratheodory solution of the following controlled ODE problem 
yk{t) = {K-k ii{t)){yk{t)) + fk{y{t)) + Bku{t), k = l,...,m, 

for all t G [0, T] ; 

(iv) fi is a solution of ( 2 . 1 ), where v : [ 0 ,T] x x Pi(M^) ^ is the time-varying vector field 
defined as follows 

v{t, X, y{t)){x) := {K -k y{t) + g{y{t))){x). 

We now derive the existence of solutions of (1.2) as limits for Ai —)• oo of the system of ODE (1.8). 
We first prove that solutions of (1.8) coincide with specific solutions of (1.2). We then prove the limit 
result with the help of Lemmata 2.4 and 2.5. 

Proposition 2.3. Let N be fixed, and the control u G L^{[{),T];hl) he given. Let {y,XN) ■ [0,r] —)■ X 
be the corresponding solution of (1.8), with XN{t) = {xi^N{t), ■ ■ ■ ,X]\f^N{t)). Then, the couple {y,iiN) '■ 
[0,T] —)■ with tiN{t) being the empirical measure 

1 ^ 

gN{t){x) := — ^(x - Xi,7v(t)), 
i=l 

is a solution of (1.2) with control u. 

Proof. It can be easily proved by rewriting (1.2) with and arguing exactly as in [23, Lemma 
4.3]. □ 

Lemma 2.4. Let iL ; —)■ satisfy (K) and g G Pi(R‘^). Then for all y G R'^ it holds 

\\{Kkg){y)\\ < Ck (l + \\y\\+ \\x\\dg{x)'^ . 

Proof. See, for instance, [23, Lemma 6.4]. □ 
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Lemma 2.5. Let K satisfy (K) and let : [0,T] —)■ : [0,T] —)■ Pi(M‘^) he 

two continuous maps with respect to Wi satisfying 

supp(/U^(t)) U supp(/i^(t)) C B{0,R), 

for every t G [0,T], for some R> 0. Then for every p> 0 there exists constant Lp^n such that 

\\K -k i/{t) - K -k p^{t)\\L<^^^B{0,p)) < Lp^RWi{i?{t),p^{t)) 

for every t G [0, T]. 

Proof. A proof of this result may be found, for instance, in [23, Lemma 6.7]. □ 

Proposition 2.6. Let G pP G VciJ^'^), and p%{-) be as in Definition 1.4--{i). Let {uN)N&i ^ 
L^([0, TpU) he a sequence of controls such that un u, for some u G L^([0, T];U). 

Then, the sequence of solutions {yN,hN) £ Lip([0, Tj; A) of (1.8) with initial data and 

control Un converges to a solution {y,p) G Lip([0,T]; A) of (1.2) with initial data {y^,pP) and control 
u. Moreover, there exists pT > 0, depending only on y^, supp{p^), K, g, fk, Bk,lL, and T, such that 
for every N gN , for every k = 1,... ,m and for every t G [0, T] it holds 


\\yk,N{t)\\,\\yk{t)\\ < PT and supp{pN{t)),snpp{p{t)) T B{0,pt). 


Proof. We start by fixing > 0 and estimating the growth of l|yfc,Af(i)|P + lki,Ar(i)|P for A: = 1,... ,m 
and i = 1,... A. Let S = {(/,j) : I = 1,... ,m and j = 1,... A}. From Hypotheses (H), Lemma 2.4 
and the compactness of 7/, it holds 


2 dt 


\yk,N\?‘ + |ki,Ar|P) = yk,N ■ yk,N + Xi^N ■ Xi^N 

= ((A* pN){yk,N) + fk{y) + Bku) ■ yk,N + ((A* pN)ixi) + g{y){xi^N)) ■ Xi^N 
< ||(A k pN){yk,N)\\ \\yk,N\\ + fk{yN) ■ yk,N + ||.Bfct(||||yfc^7v|| 

+ ||(A*^Ar)(Xi,Ar)||||Xi,Ar|| + g{yN)iXi,N) ' Xi^N 


< Ck 1 + \\yk,N 


+ Ck I 1 + \\xi. 


N 




:, 7 v||+Fi max \\yi^N\r + F 2 -\-Mi\\yk^N\ 


\xi^N\\+Gi\\xi^N\\ +G 2 max \\yi^N\\ + Gs 

Z=l,...m 


<771 max {\\ye,Nf + \\xj^N\f}+C 2 , 

(bi)es 


with Cl = 4Cii- + Ai + G 2 + Ml and C 2 = Ck + A 2 + G 3 + Mi. If we denote with b(^k,i){t) = 
||yfc,Af(A)|P + ||a^i,Af(A)|P and with a{t) = max(j j)gs{&(z,j)(A)} > then the Lipschitz continuity of a implies 
that a is a.e. differentiable, while by Stampacchia’s Lemma (see for instance [28, Chapter 2, Lemma 
A.4]) for a.e. t G [0,T] there exists a (Z,j) G S such that 

d{t) = ^ (||yz,Ar(t)||^ + \\xj,Nit)\\^) < 2Cia{t) + 2 C 2 . 

Hence, Gronwall’s Lemma and Definition 1.4-(i) imply that 

ait) < (a(0) + 2C2t)e2^i* < (Cq + 2 C 2 t)e^^^\ (2.2) 


11 



for some uniform constant Cq only depending on and supp(^®). It then follows that the trajectories 
(yAr(-), ^Ar(-)) are bounded uniformly in in a ball B{0,pt) C for 

PT := VCo + 2C2Te^^^, 

that is positive and does not depend on t or on N. This in turn implies that the trajectories 
{VNi'), fJ'Ni')) are uniformly Lipschitz continuous in N, as can be easily verified by computing ||yA:, 7 v|| 
and ||xi, 7 v|| and noticing that all the functions involved are bounded by Hypotheses (H) and the fact 
that we are inside B{0,pt)- Therefore 

\\yk,Nit)\\ < Pt , \\xi,Nit)\\ < Pt, (2.3) 

where the constant p'j, does not depend on t or on A'". 

By an application of the Ascoli-Arzela theorem for functions on [ 0 ,T] and values in the complete 
metric space X, there exists a subsequence, again denoted by {yNi'), yN{-)) converging uniformly 
to a limit p{-)), whose trajectories are also contained in B{0,pt)- Due to the equi-Lipschitz 

continuity of (2/7v(')) and the continuity of the Wasserstein distance, we thus obtain for some 

Lx > 0 


\\{y{t 2 ),Kt 2 )) - {y{ti),p{t 2 ))\\x= lim \\{yN{t 2 ),yN(t 2 )) - {yN{ti),PN(ti))\\x < - H|, 

Af^+oo 

for all ti,t 2 £ [0,2^]. Hence, the limit trajectory {y*{■), p*{■)) belongs as well to Lip([0, T]; A). 

It is now necessary to show that the limit {y{-), p{-)) is a solution of (1.2). We first verify that y is a 
solution of the ODEs part for y = /r. To this end, we observe that the limit (2/7v(')) I^A''(')) (?/(■)> I^(')) 

in A specifies into 

f VN ^y, in [0,T], 

Xi/N^y, inLi([0,r],R2'='). 

and 

lim Wi{pN{t),p{t)) =0, 

N^+oo 

uniformly with respect to t G [0,2"]. As a consequence of (2.4), (2.5), hypothesis (K), and Lemma 
2.5, for all k = 1,.. . ,m we have in [0,T] for N —)■ +oo 


(2.4) 

(2.5) 


{K-k pN){yk,N) ^ {K-kp){yk), 

fkivN) ^ fk{y)- 


To prove that y{t) is actually the Caratheodory solution of (1.8), we have only to show that for 
all k = 1 ,... ,m one has 

ijk = {Kk p){yk) + fk{y) +BkU. 

This is clearly equivalent to the following: for every r/ G and every £ G [0,T] it holds 

V- [ yk(t)dt = r]- [ [{K k p{t)){yk{t)) + fk{y{t)) + Bku{t))]dt, (2.7) 

Jo Jo 

which follows from ( 2 . 6 ) and from the weak -convergence of iik,N to yk and of u^! to u for 
N —)■ -|-oo. 

We are now left with verifying that y is a solution of (1.2) for y = y. For all t G [0,T] and for all 
(p G Ci(M'^;M) we infer that 


{(p.PNii) - Pn{^)) 



Vcpix) ■ [{KkpN){x) + g{yN){x)]dpN{t){x) 


dt, 
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which is verified by considering the differentiation 


dt 




i=l 


N 


(f){xi{t)) ■ Xi{t) 


L^=l 


and directly applying the substitution Xi = {K * ^N){xi) + g{y]\f){xi). By Lemma 2.5 and (2.5), we 
also have that for every p > 0 


lim \\K-k U]\f(t) 
N^+oo 




0 in [0,r], 


and, as (/> G has compact support, it follows that 


lim ||V(/> • {K-k pfq{t) - K k p{t))\\oo = 0 in [0,r]. 

Af^+oo 


Similarly, we have 

lim \\V(j)- (givNit)) -g{y{t)))\\oo = 0 in [0,T], 

Af^+oo 

by the compact support of cj), the -continuity of g and the uniform convergence of yN to y. 
Denote with the Lebesgue measure on the time interval [0,f|. Since the product measures 

£^L[q j'|xi/X 7 v(t) converge in Vi{[0,t\ x to £^L[q x, we finally get 


lim 

N^-\-oo 



Vcpix) ■ [K k pN{t)+g{yN{t)]{x)dpN{t){x)dt 



V<j){x) ■ [K k p{t) + g{y{t))]{x)dp{t){x)dt, 


that, together with (2.7), proves that (y,/u) is a solution of (1.8) with initial data {y^,pP) and control 
u. □ 


Corollary 2.7. Let y^ G , p,^ G Vc(R'^), and u G L^([0, T];7/). Then, there exists a solution of 
(1.2) with control u and initial datum {y^,p^). 

Proof. Follows from Proposition 2.6 by taking any sequence of empirical measures as in Definition 
1.4-(f), and the constant sequence uat = u for all G N. □ 

The following intermediate results shall be helpful in proving the continuous dependance on the 
initial data. 

Proposition 2.8. Let iF : —)■ and g : R*^™ —)■ C^(R'^;R'^) satisfy hypotheses (K) and (G). 

Then, for every R > 0, there exists L'^ > 0 satisfying L'^ < C'{1 + R) for some C" > 0, and 

\\{Kkp^){x^) - {Kkp^){x^)\\ < L'^iWiip\p^) + ||xi - x^ll), (2.8) 

for all x^,x‘^ G B{0,R) C R'^ and p^,p^ G Pi(R‘^) with supp(/U^), supp(^^) C B{0,R). 

Moreover, for every R > 0, there exists L/j > 0 satisfying Lr < C{1 + R) for some C > 0, and 

II {K k p^){x^) + g{y){x^) - {K k p^){x^) - g{y){x^)\\ < Lr{Wi{p^,p^) + ||x^ - x^H), (2.9) 

for all x^,x^ G B(0, i?) C R'^, y G B{0,R) C R'^™' and p^, p"^ G Pi(R‘^) with supp(/i^), supp(//^) C 
.6(0,7?). 
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Proof. By hypothesis, we have 


\\{K * - {K * H^)ix)\\ 


K{x — x') d{fi^ 





and 

\\{K-k fi^){x^) - {K-k^^){x^)\\ < f \\K{x^ - x) - Kix"^ - x)|| dn^{x) < Lip2^(i^)||x^ - x^||, 

JR" 

where Lip2j:j(it') stands for the Lipschitz constant of K on B{0,2R). Since from (K) it follows 

LiP2_R(-^) ^ 2C'ii-(l + R), 

this proves (2.8) for L'^ := \jIy>2r{K) and C := Ck ■ Moreover, there exists G {tx^ + (1 “ '■ t £ 

[0,1]} such that 

\\g{y){x^) - g{y){x‘^)\\ < sup \\I)yg{q){^)\\\\x^ - x^\\ < M\\x^ - x^\\, 

^eB(o,iJ)cR‘*,feB(o,iJ)cR‘*’" 

for some M > 0, from the regularity of g. It then suffices to observe that, for some C > 0, it holds 


Up2r{K) + M < 2Ck{1 + R) + M <C{1 + R). 

This proves (2.9) for Lr = Lip2j:j(ii') + M. □ 

The estimate in Proposition 2.8 shows that the following general result holds for vector fields of 
the form v{t,x, ;= {K -k + g{y(t))){x), since from Proposition 2.6 follows that x,y and g, lie 
in domains with a priori known bounds. 

Proposition 2.9. Let v,w : [0,T] x x Pi(R'^) —)■ R'^ be vector fields that satisfy the following 
hypotheses: 

1. V and w are measurable with respect to t; 

2. for every i? > 0 there exists Lr satisfying Lr < C{1 + R) such that for all G Pi(R^) 

with support in B{0,R) and all x^,x^ G R^ it holds 

\\v{t,x\g^) -v{t,x‘^,g‘^)\\ < LR{Wiig^, + \\x^ - x^\\), 

\\w{t,x^,g^) - w{t,x'^,g^)\\ < LR{Wi{g^, g^) + ||x^ - x^||). 


Moreover, given G , assume that there exist two corresponding solutions g^,g^ of 

(2.1) with vector fields v,w, respectively, and final time T. Then there exist constants Ci and C 2 
such that 

Wi{g^{t),g‘^{t)) < e^^^Wi{g^’^,g^’d + [ C 2 e^^^ sup ||u(s,x,^^(s)) - u;(s,x,^^(s))|| ds, (2.11) 

Jo xGB{0,R) 

where Ci and C 2 depend on the final time T , on the radius R and Lr the Lipschitz constant in 

( 2 . 10 ). 

Proof. See proofs in [22, Lemma 6.5, Lemma 6.6, Theorem 6.8]. □ 

We now prove the continuous dependence on the initial data, that also gives uniqueness of the 
solution for (1.2). 
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Proposition 2.10. Let the Hypotheses (H) hold. Let u G L^([0,r],ZY) he given, and take two solu¬ 
tions and {y‘^,p?) of (1.2) with control u and with initial data {y^’^, G X, 

respectively, where and have both compact support. Then there exists a constant Ct > 0 
such that 

- iy'^it),L'^it))\\x < CrWiy^'^, p°'^) - for all t G [0,r] 

Proof. We start by noticing that, by the definition of a solution, we infer the existence of a > 0 
for which y^(-),y^(-) G B{0,pt) C and supp(/x^(-)),supp(^^(-)) C B{0,pt) C 

We shall show the continuous dependence estimate by chaining the stability of the ODE 

i/kit) = {K-k p{t)){yk{t)) + fk{y{t)) + Bku{t), k = l,...,m, (2.12) 

with the one of the PDE 


dthit) = -Va; • [{K -k p{t) + g{y{t)))p{t)] , (2.13) 

first addressing the dependence of (2.12). By integration we have 

Wylit) -yk(.t)\\ <\\yk^ -Vk^W 

+ [ {\\{K-*^pHs)){yl{s))-{Kkp‘^{.s)){yl{s))\\ + \\fk{y^{s))-fk{y^{s))\\) ds. 

Jo 

(2.14) 

Eor the sake of notation, we shall denote with 


F= inax Lip (/fe), 

G= sup ||Dy5r(?)(0||. 

^eB(o,pT)cR‘*,<reB(o,pT)cR‘*™ 

Eor the left-hand side of (2.14), (2.8), the -regularity of for A: = 1,..., m, and the uniform bound 
on y^{-) and y‘^{-) yield 

Wylit) - yl{t)\\ < \\yk^-yk^\\+ (2.i5) 

+ [ + ^PTllyfc(«) “yfc(^))ll+ ^lly^(«) 

Jo 

We now consider (2.13). Define the vector fields 

v^{t,x,p) := {K k p + g{y^{f))){x), v‘^{t,x,p) := {K k p + g{y^{t))){x), 

and let —)■ M be a cutoff function on B{0,pt) with ||V<h|| < 1 and compact support in 

R*^. Observe that, since ||y^(-)ll) Il2/^(')ll ^ Pt and supp(^^(-)),supp(^^(-)) C B{0,pt), then p^ and 
p'^ also solve (2.1) with and in place of and respectively. It then follows easily from 
Proposition 2.8 that Proposition 2.9 holds for v = <I>u^ and w = Hence, from (2.11) and taking 
into account that v^ = and in B{0,pt), we have 

Wi{p^{t),p‘^{t)) < e^^*Wi{p°’^, p°’^)-^ [ sup \\v^{s,x, p^{s)) - v‘^{s,x, p‘^{s))\\ ds, 

Jo x&B{0,pt) 

for some constants Ci and C 2 . By (2.9) and the regularity of g, for every s G [0,T] we have 
||u^(s,x,^^(s)) - u^(s,x,^^(s))|| < Lp.^Wi{p^{.s), p‘^{.s)) + G\\y^{.s) - y^{s)\\. 
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This gives 


+ / C' 2 e'^i* (Lpj,>Vi(;U^(s),;U^(s))+ G|| 2 /^(s)-/( s)||) ds. (2.16) 
Jo 

We now consider the function 

e{t) := \\{y\t),^,\t))-{y\t),^^\t))\U 
and, combining (2.15) for each k = 1,... ,m and (2.16), we obtain 

£(t) <||y°’^-2/°’^|| + [ {L'p^Wi{fi^{s),fi^{s)) + L'pJy^{s)-y^{s))\\+mF\\y^{s)-y^{s)\\) ds 
Jo 

+ e'^i*>Vi(/U°’\^°’^) + [ 026 ^^''{Lp^Wi{fi^{s),fi^{s)) + G\\y^{s)-y‘^{s)\\) ds 
Jo 

<£(0)e^^* + f {L'p^ + mF + {Lp^, + G)C* 2 e^^'^)e(s) ds. 

Jo 

Gronwall’s lemma then implies 

£{t) < e(0)e^i‘ {^L'p^ + mF)t + ^ 

Since t G [0, T], the result is proved. □ 

Remark 2.11. Going back to the application of the Ascoli-Arzela Theorem in Proposition 2.6, con¬ 
sider another converging subsequence of {yN,yN)- We can prove that its limit is another solution of 
(1.8). Since the solution is unique for Proposition 2.10, we have that all converging subsequences of 
{yN,dN) have the same limit, hence the sequence {y]\f,fiN) has itself limit {y*,fi*). 

Remark 2.12. Since equicompactly supported solutions are unique, given the initial datum, by 
Proposition 2.10, combined with Proposition 2.6 we infer that the support of the unique solution can 
be estimated as a function of the data, namely it is contained in a ball B{0,pt), where the constant 
is depending only on y^,supp{pP), K, g, fk, and T. 

2.2 Existence and constrnction of solutions of Problem 1 

In this section, we prove that Problem 1 admits a solution which is a mean-field optimal control. The 
proof generalizes similar results in [22]. 

We first recall the main definition of T-convergence. We then define the sequence of functionals 
(EAf)AreN related to Problem 2 and F related to Problem 1 and prove that (TAr)ArgN P-converge to 

F. 

Definition 2.13 (P-convergence). [20, Definition 4.1, Proposition 8.1] Let X be a metrizable sep¬ 
arable space and Fat: X —)■ (— 00 , 00 ], iV G N be a sequence of functionals. We say that {F^)m£M 
P -converges to F, written as F^r —> F, for a given F: X —)■ (— 00 , 00 ], if 

1. liminf-condition.' For every n G X and every sequence um u, 

Fiu) < liminf Fn(un); 

N ^+00 
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2. lim sup-condition; For every u £ X, there exists a sequence uj^ —^ u, called recovery sequence, 
such that 


F{u) > limsupF;v(n7v)- 

W—)-+oo 


Furthermore, we call the sequence equi-coercive if for every c G M there is a compact 

set K C X such that {u : F]\f{u) < c} C K for all G N. As a direct consequence of equi-coercivity, 
assuming G argmin^Ar / 0 for all Ai G N, there is a subsequence {u*j^^)k£n and u* G X such 
that 

-£■ u* £ arg min F. 

In view of the definition of F-convergence, let us fix as our domain X = L^([0,T];ZY) which, 
endowed with the weak -topology, is actually a metrizable space. 

Fix now an initial datum {y^,fjP) £ X, with fjP compactly supported, and choose a sequence 
as in Definition 1.4-(i). 

Consider the functional F{u) on X defined in (1.1), where the pair {y,fi) defines the unique 
solution of (1.2) with initial datum {y^,pP) and control u. Similarly, consider the functional Fn{u) 
on X defined in (1.7), where the pair (yAr,^Ar) defines the unique solution of (1.2) with initial datum 
{y^,^^) and control u. As recalled in Proposition 2.6, such solution coincides with the solution of 
the ODE system (1.8). 

The rest of this section is devoted to the proof of the F-convergence of the sequence of functionals 
{FN)NeN on X to the target functional F. Let us mention that F-convergence in optimal control 
problems has been already considered, see for instance [ 12 ], but, to our knowledge, it has been only 
recently specified in connection to mean-field limits in [22, 23]. 

Theorem 2.14. Let the functionals (1.1)-(1.7) and dynamics (1.2) satisfy Hypotheses (H). Consider 
an initial datum {y^,pP) £ x Vi{W^), and a sequence where is as in Definition 

1.4-{i). Then the sequence of functionals (FAr)jveN on X = L^{[t),T]-,U) defined in (1.7) T-converges 
to the functional F defined in (1.1). 

Proof. Let us start by showing the F — liminf condition. Let us fix a weakly convergent sequence of 
controls un ^ u* in A. We associate to each of these controls a sequence of solutions {yN,LN) of 
(1.2) uniformly convergent to a solution {y*,p,*) with control u* and initial datum (y°,^°). In view 
of the fact that solutions {yN,hN) and {y*, p*) will have uniformly bounded supports with respect to 
N and t £ [0,T] and by the uniform convergence of trajectories yN{t) ^ as well as the uniform 
convergence , h*(t)) —)■ 0 for f G [0, T], it follows from the continuity of L under Hypotheses 

(H) that 

lim [ L{yN{t),p.N{t))= ( L{y* (t), p* {t))dt. (2.17) 

N^ + OoJq Jq 

By the assumed weak convergence of (tt 7 v)AreN to u* £ X and Ioffe’s Theorem (see, for instance, [2, 
Theorem 5.8]) we obtain the lower-semicontinuity of 7 

liminf [ j{uN{t))dt > [ j{u*{t))dt. (2-18) 

N^-Coo Jq Jq 

By combining (2.17) and (2.18), we immediately obtain the F — liminf condition 


liminf Fn{un) > F{u*). 

N^-\-oo 
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We now prove the T — limsup condition. We now fix u* and consider the trivial recovery sequence 
un = u* for all N G N. Similarly as above for the argument of the T — liminf condition, we can 
associate to each of these controls a sequence of solutions (vNit), fJ'Ni't)) of (1.2) uniformly convergent 
to a solution (y* (t), fj,* (t)) with control u* and initial datum and we can similarly conclude 

the limit (2.17). Additionally, since {un)n&'N is a constant sequence, we have 

liminf [ j{uN{t))dt = [ j{u*{t))dt. (2-19) 

N^+OO Jq Jq 

Hence, combining (2.17) and (2.19) we can easily infer 


limsup Fn{un) = lim Fn{u*) = F{u*). 

N^oo N^oo 


□ 

Corollary 2.15. Let the Hypotheses (H) in Seetion 1.1 hold. For every initial datum {y^,yP) G 
^dm ^ there exists a mean-field optimal control u* for Problem 1. 

Proof. Consider empirical measures as in Definition 1.4-(i). Notice that the optimal controls 
of Problem 2 belong to X = L^([0, T];7/), which is a compact set with respect to the weak topology 
of . Hence, the sequence (pAr)ArgN is equicoercive, and (wJ^)AreN admits a subsequence, which we 
do not relabel, weakly convergent to some u* £ X. 

We can associate to each of these controls and initial data {y^,y^) a solution (yAr,^Ar) of 
(1.2). The sequence of solutions (yAr,/ijv) is then uniformly convergent to a solution {y*,ij,*) of (1.2) 
with control u* , by Proposition 2.6. In order to conclude that u* is an optimal control for Problem 
1 (and hence, by construction, that u* is a mean-field optimal control) we need to show that it is 
actually a minimizer of F. For that we use the fact that F is the P-limit of the sequence (FAr)jvgf^ as 
proved in Theorem 2.14. Let rt G X be an arbitrary control and let (iiAr)iVeN be a recovery sequence 
given by the P — lim sup condition, so that 

F{u) > limsupFjv(W at)- (2.20) 

N^-\-oo 

By using now the optimality of we have 

limsup Fat( iijv) > limsup Fat(mJ^) > liminf Fat(hJ^). (2 -21) 

Af^+oo Af^>+oo N^-Coo 

Applying the T — lim inf condition yields 

liminf Fiv(M^) > F{u*). (2.22) 

N—>--eoo 

By chaining the inequalities (2.20)-(2.21)-(2.22) we have 

F{u) > F{u*), for all u £ X, 


i.e., that u* is an optimal control. □ 

Remark 2.16. Observe that the previous result does not state uniqueness of the optimal control for 
the infinite dimensional problem. Indeed, in general, we cannot ensure that all solutions of Problem 
1 are mean-held optimal controls. 
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3 The finite-dimensional problem 


In this section we study the discrete Problem 2 and state the PMP for it. We first recall the following 
existence result for the optimal control problem. 

Proposition 3.1 (Theorem 23.11, [15]). Under Hypotheses (H), Problem 2 admits solutions. 

We now introduce the adjoint variables of Xi and yt, denoted by pi and qu , respectively, and 
state the PMP in the following box. 


Theorem 3.2 (Theorem 22.2, [15]). Let he a solution of Problem 2 with initial datum 
(y(0),x(0)) = (y^,x^), and denote with {y* {■), x* {■)) : [0,T] — the corresponding trajec¬ 
tory. Then there exists a Lipschitz curve {y*{■), q*{■), x*{■),p*{■)) G Lip([0, T], solving 

the system 


r, = Vg,MNiy*,q*,x*,p*,u*) 
ql = -Vy,MN{y*,q*,x*,p*,u*) 

< x*=VpMN{y*,q*,x*,p*,u*) ( 3 . 1 ) 

p* = -V,MN{y*,q*,x*,p*,u*) ’•••’ ’ 

u*N = aigmaxB.Niy* ,q* ,x* ,p* ,u), 

, u£U 

with initial datum {y{0),x{0)) = (y^,x^) and terminal datum {q{T),p(T)) = 0, where the Hamil¬ 
tonian Hat : ]^ 2 dm+ 2 dA'' given by 


MN{y,q,x,p,u) = 


>*• 

1 \ i=i 


- xy) + giy){x,) \ + 


m / ^ N \ 

+ Xj) + fk{y) + Bku\ - L{y,pN) - 7 {u), 


with pn = Jr - xt) 


Remark 3.3. The general statement of the PMP contains both normal and abnormal minimizers. 
In our case, the simpler formulation of the PMP is given by the fact that we have normal minimizers 
only. This is a consequence of the fact that the final configuration is free, see e.g. [15, Corollary 22.3]. 

Remark 3.4. The uniqueness of the maximizer of Hjv follows from the same motivations reported 
in Remark 1.2. Indeed, the form of the Hamiltonian implies that for each u* it holds 

( m 

/ Qk ■ BkU — 7(tt) 
k=l 

In other terms, since the control acts on the y variables only, then we have a simpler formulation for 
the maximization of the Hamiltonian Hat . 

We now want to embed solutions of the PMP for Problem 2 as solutions of the extended PMP for 
Problem 1. As a first step, we prove that pairs control-trajectories (tt]^, (y^, x]^,p^)) satisfying 

system (3.1) have support uniformly bounded in time and in G N. 
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Proposition 3.5. Let G G , O'nd be as in Definition 1.4-{i)- Let be a 

solution of Problem 2 with initial datum and let (y^, be a pair control- 

trajectory satisfying the PMP for Problem 2 with initial datum and control u*j^ given by 

Theorem 3.2. 

Then the trajectories (y^(-)) 9Ar(')) ^Ar(')) ? where ufj- := , are equibounded and equi- 

Lipschitz continuous from [0,T] to y, where the space y := x is endowed with the 

distance 


ll(y,9,i^) - {y',<l',v')\\y = ||y-y'|| + \\q-q'\\ +]yi{n,y). (3.3) 

Furthermore, there exists Rt > 0, depending only on y^,supp{y^),d,K,g, fk,Bk,lL, and T, such that 
supp(i/)(r(-)) T B{0,Rt) for all N gN. In particular, itholdsW{ylj,qlj,vlj,u),j) = Wc{y*N-,yN-,'^N-,u). 4 )- 

Proof. As a first step, notice that the pair solves the system (1.8). It then follows from 

(2.2) and (2.3) that there exist two constants px and pfi, not depending on N such that, for all 
i = 1,..., A", for all k = 1,... ,m, and a.e. t G [0,T] we have 

||yfc,Ar(t)|| < PT , IKiv(t)|| < PT (3.4) 

||2/fc,Ar(^)|| < Pt ) \\^i,Ni'^)\\ — Pt ■ (3-5) 

It follows in particular that there exists a uniform constant Wt such that 


1 ^ 


i=l 


< Wt 


for all t G [0, T]. 

We now observe that by an explicit computation 


NV,MM,qN,x*N,P*N,u*^) • ez =N 


r* 1 r* 1 

Y, D^«iV - 4iv)ez - <N)ei 


N N 


i=i 


i=i 


m 

+ • Vy(y^)(x,V) -NY ' ^D^(4,iv - <iv)ez 


k=l 


- (y)v, <iv, w Ef=i ^(4,iv)) • ez 

+ F EjLi ^( 4 a)) • ez^ 

1 ^ 

''fif “ 4a) ■ (^■^(4zv “ 4 a)^ 0 + 4Ar • Da;y(yXr)(4Ar) 

i=i 

m 

- Y qIn • (D^(4,iv - 4iv)e0 - {y%, ^ Ef=i ‘^(4,7v)) ' ^z 

fc=i 

- (V,£ (y^, ^ Ef=i ^(4,zv)) D^(47 v)) • ez, 

(3.6) 
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for each i = 1,...,iV and each I = 1,... ,d (where we have used that DK is even and we merged the 
first two terms). Therefore, since p *solves (3.1) and r*= Np*^, we get 


-rlnit) ■ ei ^«Ar(i) - r’*jv(t)) • (DK{xlj^{t) - x*^(t))eO 
i=i 

m 

+ K,Nit) ■ (Dyg{yNit))ixlN{t))ei) ■ (DA'(yfe,jv(t) - <7v(t))e0 

k=l 

- (yArW>< 7 v(i), w Da;(x*^(t))) • e;, 

for each i = 1,..., N, each / = 1,..., d, and a.e. t G [0, T], where we have used the fact that Y)K is 
even. We now denote with Lt a uniform constant such that 


\\'^^\\L^{B{ 0 ,pT),M.d'Xd') ^ Lt , sup ||Dyfl'(y)(-)llL°°(B( 0 ,pT),®‘*^‘*) — 

\y\<\GnRT 

L°°(B(0,VmpT)xB(0,pT)xS(0,lVT),K‘*) — , \\'^‘;^\\l°°{B(0,^pt)xB(0,pt)xB(0,Wt),^‘^) — : 

l|Dw||ioo(^(0,py),Rdxd) < Lt , 

and we easily get the estimate 


' LN 


< VdL'T I 21 


i,N 


1 


N 


i=i 


k=l 


+ 1 + Lt 


(3.7) 


for each i = 1,..., and a.e. t G [0, T]. An explicit computation of Vpj,]HlAr and a similar argument, 
possibly with another constant Lt , show the estimate 


\Qk,N 


/ N N 

< VdLx ( — ^ lki,Ar(^)ll + 2||g'fc^7v(^)ll + T'T 

\ i=l / 


for each A: = 1,..., m and a.e. t G [0, T]. We now set 


(3.8) 


N 


^N{t) ■■= WqIn 

k=l 






and observe that it holds 


i=l 


N 


|eiv(i)| < ll9fc,Af(^)ll + ^ Y^ W'^In 


k=l 


2 = 1 


therefore (3.7) and (3.8) yield 


l£Ar(2)| < VdLx (4ejv(A) + 1 + “^Lt) 


(3.9) 


Defining then the increasing functions pwit) through pN{t) ■= sup.,-g[o,t] ^n{T — t) , and observing that 
it holds pn{^) = 0 for the boundary conditions in Theorem 3.2, from (3.9) and Gronwall’s Lemma we 
obtain 

Pn{t) < Vdlrril + 
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and with this 


(3.10) 


eN{t) < m{T) < VdLrTil + ;= Ct 

for all t G [0,T]. Plugging into (3.9), we get the existence of a constant C'rp such that 

|£A''(^)| ^ c'rp (3.11) 

for a.e. t G [0, T]. Since by definition of and standard properties of the Wasserstein distance 

Wi it holds 

/ N N 

Wi{u*ip{t + T-),VN{t)) < V2 + t) - xlN{t)\\ + j;^'^\\rlN{t + T) -r*jv(t))|| 

i=l ^ i=l 

from the previous inequality, (3.4), (3.5), (3.9), (3.10), and (3.11) we obtain that y*j^(t) and g^(t) are 
equibounded, that there exist a constant, denoted by Rt, such that snpp{u]^{t)) C B{0,Rt) for all 
t G [0,T] and that equi-Lipschitz continuous from [0, T] with values in 3^. □ 

Proposition 3.6. Let iV G N and u*j^ G U’{[0,T];l/() be an optimal control for Problem 2 with 
given initial datum (?/^,x^) G and G Lip([0,T], a 

eorresponding trajeetory of the PMP with maximized Hamiltonian Htv • 

Define (^1 j'p"! TV 5***5 ^Tv N 5 LN TV ^ wzth 11 TV Tis zn ( 1 . 11 ) j and assume that supp (zz^y (*)) 

B{0,Rt)- Then, the control is optimal for Problem 1 and {y*j^, , u*j^) satisfies the extended 

Pontryagin Maximum Principle. 

Proof. First observe that, by Proposition 3.5, ]HIc( 2 /^, ttj^) = B[(y^, ( 7 ^, tt^) and that for 

every t G [0, T] 



u*^{t) = avgm.axMN{yN{t),q*j,f{t),x*^{t),p%{t),u) u*^{t) = argmaxH(y^(t), g;^(t), ^')(r(t), u), 

u&A uGU 

due to the specific form of the Hamiltonian Htv and H, see Remark 3.4. 

We now prove that 


^fc,TV ~ qk^N {Vn 1 ^*N 1 P*N ^'Cn) 1 

9k,N — ~'^yk^N{yN^9N^^*N^P*N’^*N)^ 


yk,N — 9^5'^TV5 ^?^)5 

91,N — Vk^iVN^ 9 n^ ^’’^n)’ 


i.e., that if the {y,q) variables satisfy the PMP for Problem 2 then they satisfy the extended PMP 
for Problem 1. It is sufficient to observe that Htv can be rewritten in terms of k'ff(-) as follows 


■mr / * * * * \ 

^N[yN,9N,^N,UN) 


r ■ {K -k (x) dvf[ (x, r) + 


r-g{y*N){x) du*^{x,r) 


m « 

+ IZ / 9l,N-K{yk,N-x)dv*jq{x,r) 


m 

+ X/ 

k=l 


- L{y*N, TTl^V*^) - -i{uy), 


Uk{yk,N) + BkU*j^) 


where we used the variable r = Np. Comparing it with H(t/^, tt^), one has that their 

expressions coincide up to the first term. Since such first term is independent on y^ and q^, then 
Vyj,HTv = Vyj,H and Vgj,HTv = Vgj,H, hence equations for y^ipAkN ™ PMP for Problem 2 and 
in the extended PMP for Problem 1 coincide. 
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We now prove a similar result for the variables, with = Np*j^. After this change 

of variable, the third and the fourth equation in (3.1) become 


i^i,N 


— N\7{Un7 Qn7 

— “AiV2;^]Hl7v(y^, q'j ^7 x*j^,p*j^, u*j^) . 


We want to prove that the following identity holds 


J(Vi,]HIc(2 /^, q*j^, u*^))ix*^^, r*^j7f ) 


N'Vj.i^Niy%-7QN'’^*N’P*N^'^*N) \ 

—N'Vxi^Niy'j\f7QN,x*j^,p*j^,u*j^) y’ 


(3.12) 


i.e., that the Hamiltonian vector helds generated by HI and Hat coincide in each point {x* , r* . 
The presence of the constant N in the right-hand side is due to the change of variables r* = Np*^^. 
By applying J~^ on both sides of (3.12), we need to prove 


VA{y*N7qN7^N7uy){xl^7rlN) • ez 
'^u^ciyN7 ^*Ni^*N){Pi,N^ ''"In) ■ 


NVxiMNiyN^QN^^N^PN^^N) ■for I = 1,... ,d, (3.13) 

NVrMN{ylf,qlf,x*^,p*j7^7U*^) ■ ei_d for I = d+1,... ,2d{3.U) 


By writing explicitly the left hand sides of (3.13) and (3.14) by using the expressions (1.5)-(1.6) and 
evaluating them in {x* n 7 r* , we have 


VMyN7 q*N7 '^*N7 «^)«7vT- iv) • ez = ^ - r*,^) • (Di^(x* at - xl^)ei) + • (D,5(y^)«zv)ez) 

j=i 

m 

- Y, qlN • (Di^(y^,zv - <7v)ez) - (y^, ^ Ef=i ^(^In)) ' ^z 

Zi:=l 

- (v,£ {y*^,xl^, i Ef=i^(4zv)) D^«iv)) • ez, 


for / = 1,..., d, so that (3.13) follows immediately from (3.6). Similarly, we have 


V;/IHIc(y^, q'^, z/)(r, u^)(x*^, r*^) • ez 


1 ^ 

— Y - ^In) 
1=1 


ez-cz + 9{yN)i^i,N) 


ei-d 


for / = d -|- 1,... ,2d, which coincides with the right hand side of (3.14) by an explicit computation. 
Since the boundary conditions of Problem 2 and Problem 1 coincide too, after the identification 
■= nAr(x)| zV’Pi N’ ■ ■ ■ N’Pn n)^ result follows now by (3.13)-(3.14) arguing, for instance, as 
in [23, Lemma 4.3]. □ 


Remark 3.7. It is interesting to observe that the embedding of a trajectory of the PMP for Problem 
2 to the empirical measure formulation depends on the number N of agents, see the definition of 
Htv in (1.11). This is a consequence of the fact that the Hamiltonian of the PMP for Problem 2 
actually depends on the number of agents. Indeed, consider a population composed of a unique agent 
(xi,pi), for which the first term of the Hamiltonian reads as pi ■ g{y){xi). Consider now a population 
composed of two agents (xi,pi, X 2 ,P 2 ) satisfying xi = X 2 and pi = P 2 7 for which the hrst term of the 
Hamiltonian reads as 2pi • y(y)(xi). 

Clearly, in both cases the empirical measure in the state variables is pi = p 2 = <5(x — xi), 
while the dehnition of Hzv gives two different empirical measures for the cotangent bundle: vi = 
5{x — xi,r — pi) and V 2 = \5{x — xi,r — 2pi). This difference is needed to compensate the dependence 
of the Hamiltonian of the PMP for Problem 2 on the number N of agents. 
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4 The Wasserstein gradient 


We anticipated in Section 1 that the dynamics of i'* in (1.3) is an Hamiltonian flow in the Wasserstein 
space of probability measures, in the sense of [3]. This means that the vector field is an 

element with minimal norm in the Frechet sub differential at the point i'* of the maximized Hamiltonian 
He introduced in Theorem 1.1 (we drop for simplicity the y, q and u dependency). The proof of this 
fact shall follow the strategy adopted to obtain analogous results in [4, Chapter 10], which however 
cannot be applied verbatim to our case due to the peculiar nature of our operators. In order to use 
those techniques, we consider our functionals defined on instead than on Pi(M^'^). Since we 

have already proven in Proposition 3.5 that, whenever we start from a compactly supported initial 

datum, the dynamics remains compactly supported uniformly in time, this assumption does not alter 
our conclusions. 

We start with some basic definitions and general results on functionals defined on V 2 (K^'^): the 
following one is motivated by Definition 10.3.1 and Remark 10.3.3 in [4]. 

Definition 4.1. Let V’ • ^’ 2 ( 1 ^^'^) (~oo, + 00 ] be a proper and lower semicontinuous functional, and 

let vq G D(V'). We say that w G belongs to the (Frechet) subdifferential of V’ at vq, in 

symbols w G d'4){vQ) if and only if for any vi G 7^2(1^^*^) h holds 


V’(j^i) - V'(zzo) > inf , 


^^.’(^o) • {zi - zo)dp{zo, Zi) + o{W2{izi,no)). 


Proposition 4.2 ([4], Theorem 10.3.10). Fix the functional V’ : V 2 (^‘^^) (— 00 ,+ 00 ]. Then, for 

every vq G the metric slope 


satisfies 


{dipKno) = limsup 

ui^uo 


- V^(zzo))+ 

>V2(j^i,r'o) 


\d^p\{no) < \\w\\l2 ^ (4.1) 

for every w G 9V’(^o) • 

The following property shall guarantee that the subdifferential of He is nonempty. 


Definition 4.3. A proper, lower semicontinuous functional i) : 7 ^ 2 ( 1 ^"') ^ (— 00 ,+ 00 ] is semiconvex 
along geodesics whenever, for every G 7 ^ 2 ( 1 ^"') and p G To{'izq,'izi) there exists C G M for which 

it holds 


V’(((l - s)7ri + s'K 2 )#p) < (1 - s)V'(t'o) + + Cs{l - s)>V|(r'o, z^i)) for every s G [0,1]. 

In what follows, we shall fix y,q G and u G L^([0,T];77) and we write, for the sake of 

compactness, Hc(zz) in place of M.c{y,q,v,u). Moreover, K, shall denote a convex, compact subset of 
R^'^ and z = (x,r) a variable in R^*^. 

Whenever supp(zz) C B(0,Rt), Hc(jz) can be rewritten as 

Bic(i^) =[ F{z - z')dv{z)v{z') + [ Q{z)dv{z)— [ £{7ri{z), Juj7ri#u)dh'{z) + Q, 

Z Juid jRd J^d 

where 


F{x, r) = r ■ K{x) 

m 

Q{x,r) = r ■ g{y){x) + • F[{yk - x), 

k=l 
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and Q collects all the remaining terms not depending on u. Notice that J- is an even function. 

In order to prove the semiconvexity of Etc, we shall establish the semiconvexity of the following 
functionals: 

= ■^ [ P{z - z')dv{z)v{z') + f Q{z)dv{z), 
i{z,jLou)du{z), 

where T, G, i, and cD are functions. The desired result will then follow by noticing that = 

+'^{v) ioT F = F, G = G, i = —io ( tti , Id), lo = oj o m and 1C = B{0, Rt) ■ 

The following simple property will be needed to prove semiconvexity of the above functionals. 

Lemma 4.4. Let G with support contained in 1C. Let p G T{uq,ui) and set 

Vs = ((1 - s)7ri + s-K2 )#P, (4.2) 

for every s G [0,1]. Then, it holds 

supp(i/s) C 1C for all s G [0,1]. 

Proof. We first notice, that for every p G T{vq,vi) it holds 

supp(/3) QIC X 1C . (4.3) 


This follows from the equality 

M^'^\(/C X /C) = (M^"* X (m2'^\/C)) U ((m2'^\/C) X R^"*) 

and from the fact that both R^*^ x (R^‘^\/C)) and (R^‘^\/C) x R^'^ are p-null sets by hypothesis. 

To prove the Lemma, it suffices to show that for all / G C(R^^) satisfying / = 0 on /C it holds 


Indeed, 


/ fdvs = 0 . 

jR2d 

fdus = / fd{{l - s)7ri + STi 2 )#p{zo,zi) 

= / f{{l-s)zQ + szi)dp{zQ,zi) 


(4.4) 


iKxK 


/((I - s)zo + szi)dp{zQ,zi), 


since, by (4.3), supp(p) Q 1C x 1C. From the convexity of fC follows that (1 — s)zo + szi G K, for every 
s G [0,1], which, together with the assumption / = 0 in /C, yield (4.4), as desired. □ 


In what follows, we shall make use of the following, well-known result. 

Remark 4.5. Let /C be a convex, compact subset of R^'^ and let / G C^(R^'^;R). Then there exists 
Cicj G R depending only on K. and / such that 

/((I - s)xo + sxi) < (1 - s)f{xo) + sf{xi) + Ck:js{1 - s)||xo - xi|p, (4.5) 

for every Xo,xi G R^'^ and s G [0,1]. 
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We now prove the semiconvexity of . 

Lemma 4.6. Let vqjI'i G Vc{^'^) and let p G Then, there exists C G M independent of vq 

and ni for which 

]HIc(((l - s)vri + s'K2)#p) < (1 - s)IHIc(^o) + 'SEIc(^i) + Cs{l - s)>V|(z/o, ni) 

holds for every s G [0,1]. 

Proof. We may assume supp(r'o), supp(z/i) C /C for some convex and compact set /C C otherwise 
the inequality is trivial. Hence, from Lemma 4.4, it follows supp(i^ 5 ) C /C for every s G [0,1]. But 
then, since J- and Q are both the result follows as in [4, Proposition 9.3.2, Proposition 9.3.5]. □ 

Corollary 4.7. Let u! G G Vc0^‘^‘^), p G T{vq,i>i) and define Ug as in (4.2) for 

s G [0,1]. If we set 

f,s= (Ldug, (4.6) 

then 

116 - (1 - s)6 - s6|| < Cs{l - s)Wi{uo,ui), 
for all s G [0,1], where C is independent of uq and ni. 

Proof. Follows from Lemma 4.6 applied first to the functions 7^ = 0 and Q = to, and then to = 0 
and G = —uj. □ 

The semiconvexity of E!^ will be deduced as a corollary of the following estimate. 

Lemma 4.8. Suppose that i G x M'^;R), let zo,zi G /C and set Zg = {1 — s).2o + szi for all 

s G [0,1]. Furthermore, let G Pc(R^'^)j p G r(i^o,*^i) and define Vg and as in (4.2) and (4.6) 
for s G [0,1]. Then, for all s G [0,1], it holds 

i{Zs,^s) < (1 - s)6^o,6) + •562:1,6) + - s)>V|(r'o, J^l) + 

for some constant depending only on IC,i and u. 

Proof. Since K, is compact, Zg & K. for all s G [0,1]. Moreover, (1 — s)6 + ^6 G K,' for all s G [0,1], 
for some convex and compact set K,' C R'^. Notice that from (4.5) follows 

i{zg, (1 - s)6 + s6) < (1 - s)6^o, 6) + 6) + Ck.,k.'s{ 1 - s) ([[zo - 2 i||^ + 116 - 611 ^) , ( 4 - 7 ) 

and from the definition of 6 Jensen’s inequality, we get 

116 - 611 ^ < LiPyc(‘^)>Vi6i^o,i^i) < Lipyc(a;)>V|(t'o,i^i)- 

Moreover, for every s G [0,1] it holds 

1162s, 6) - 62s, (1 - s)6 + .s6) II < Lip^xyc' 116 - (1 - s)6 - s6 II 

< LipK;xiC'5(l - .s)C'>V|(r'o, J^i). 

Hence, for every s G [0,1], using (4.7), (4.8) and (4.9), we get 

62s, 6) = 62s, 6) - 62s, (1 - .s)6 + s6) +62s, (1 - . 5)6 + .s6) 

< (1 - 5)620,6) + 5621,6) + “ s)>V26t'0,l^l) + Cf ^ l ^^ s{l - s)||2o - 21 f. 

□ 


(4.8) 

(4.9) 


26 



Corollary 4.9. Let uq, ui G <ind p G Lo{vq, ui). Then, there exists C G M independent of vq 

and ni for which 


]HIc(((l - s)7ri + s'K2)#p) < (1 - s)EIc(^o) + + Cs{l - s)>V|(z/o, ^i) 

holds for every s G [0,1]. 

Proof. Notice that, by Lemma 4.4, H‘l{us) can be rewritten as 

^li^s)=[ iizs,Cs)dpizo,zi), 

JkxK. 

Furthermore, since p G To{nQ,ni) it holds 

/ \\zo-zi\\‘^dp{zQ,zi) = j \\zq- zi\\‘^dp{zQ,zi) = Wl{vQ,vi), 

JlCxK 

the thesis follows from Lemma 4.8. 


□ 


Proposition 4.10. The functional He is semiconvex along geodesics. 


Proof. Follows directly from Lemma 4.6 and Corollary 4.9, by noticing that Hc(j^) 
for .F = G = G, £ = —£o (tti. Id), w = cu o TTi and 1C = B{0, Rt) . 


wl{u)+nl{u) 

□ 


We define the vector field VuC. : —)■ as 


VX{z) 


TTi{z), fwTTi^v) + V,j£(y, 7ri(2:), /a;7ri#i/)Da;(7ri(z)) 
0 


for every z G This notation is reminiscent of the fact that this vector field will eventually turn 
out to be the 2-Wasserstein gradient of the functional L, as it will follow from Theorem 4.12 in the 
case = ^ = 0. We can thus define our candidate vector field for the Wasserstein gradient V;yHc(i^o) 
in the case that supp(r'o) CL B{0,Rt). 


w := (VJ^) -k V + VG — VjyF. 


(4.10) 


Notice that, by Hypotheses (H), re is a continuous function in z, and hence it is well-defined iz-a.e.. 

Lemma 4.11. Let v G ■ Then w defined by (4.10) belongs to L])(M^'^) for every p G [l,+oo], 

and it satisfies 


/ w{zo) ■ {zi - zo)dp{zo, zi) 


{VF{zo - Z 2 ) + yG{zo) - VyC{zo)) ■ {zi - zo)dp{zo, zi)dv{z 2 ) 

(4.11) 


for every plan p G F(i/, v') such that u' G VcO^‘^'^) • 

Proof. Since w is continuous, the fact that w is L])-integrable follows the fact that n has compact sup¬ 
port. Equation (4.11) then follows by Fubini-Tonelli and from the fact that p is compactly supported 
too by Remark 4.4. □ 

Theorem 4.12. Let v G P 2 (H^'^) be such that supp(j^) C B{0,Rt). Then v G Il(|(9Hc|) if and only 
if w as in (4.10) belongs to L^(M^'^). In this case, ||rc ||£,2 = |clHc|(z^), i.e., w is an element with 
minimal norm in dM.c{v). 
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From the last inequality, the assumption that < +oo and using the trivial estimate 

W-2{{Id + si)#v,v) < s ||^|| l 2 , 

we get 

/ w{zo) ■i{zo)dv{zo) < \dB.c\{u)\\^\\L2 

jR2d 

and hence, changing the sign of 

/ w{zo) ■ ^{zo)du{zo) < \dB.c\{u)\\^\\L 2 

jR2d 

This proves that w G and that ||w^||l2 < \dMc\{i^). 

We now prove that if the vector w belongs to , then it is in the subdifferential of He; this 

shall imply, by (4.1), that w G Il(|clHc|) and that it is a minimal selection dMc{i'), by the previous 
estimate and Proposition 4.2. 

We thus consider a test measure T, a plan p G To{v-,V) , and we compute the directional derivative 
of He along the direction induced by p. Denoting with 

£i(s) = - s)zo +szi, jojd{{l - s)t:i +st:2)#p), 

^2(5) = ^{y,ZQ, jujd{{l- s)tti +37:2)#p), 
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for every s G [0,1], then the map 

_ T{{1 - s){zo-zo) + s{zi-zi)) - J='{zo-zo) 

S I —r - 

S 

^ g((l - s)zo + szi) - g{zo) _ £i(g) - £2(3) _ £2(5) - A( 0 ) 

s s s 

as s —)■ 0 converges to 


VT{zo - zo) ■ {{zi - zq) - {zi - zo)) + {VG{zo) - V„£{zo)) ■ {zi - zq). 


Hence, from Proposition 4.10, the dominated convergence theorem, the identity (4.11) and since 
is odd, we get 

H,(p)-H,(0 > li,,, «■.(((! - »)’ri + 

S 


i / VTizo - Zo) ■ {{zi - Zo) - (zi - Zo)) dp{zo, zi)dp{zo, zi) 
4 jRSd 

+ [ (ve(zo) - V^C{zo)) ■ (zi - zo)dp{zo,zi) + o(>V2(P, iz)) 


= / w{zo) ■ (zi - zo)dp{zo,zi) + o{W 2 iiz,i')). 

We have thus proven that w G dM.c{v). 


□ 


5 Proof of Theorem 1.1 

In this section, we prove Theorem 1.1. We first recall that we already proved in Corollary 2.15 that 
there exists a mean-field optimal control for Problem 1. We now want to prove that all mean-field 
optimal controls are solutions of the extended PMP. 

Let u* be a mean-field optimal control for Problem 1 with initial datum {y^,pP). Fix p,^ as in 
Definition 1.4-(i), and consider a sequence (n^)ArgN of optimal controls of Problem 2 with initial 
datum {y^,p!^), having a subsequence (which, for simplicity, we do not relabel) weakly converging 
to u* in L^([0,r];^^). Denote with (y^,x^) the trajectory of (1.8) corresponding to the control 
u*j^ and the initial datum (y°,/i^) of Problem 2. Compute the corresponding pair control-trajectory 
[y]^, , x*j^, p*j^)) satisfying the PMP for Problem 2, that exists due to Theorem 3.2. Set := 

^n{xn,P*n) and r*^ := Np*j^f. By Proposition 3.5, the trajectories (y^, r'jv) are equibounded and 
equi-Lipschitz from [0, T] to the product space y = x Vi (M^'^) endowed with the distance eqrefe- 
Y, and the empirical measures iz'^ have equibounded support. Moreover, the pair (uj^, iy*j^, q]^, iz]^)) 
satisfies the extended PMP by Proposition 3.6. 

By the Ascoli-Arzela theorem, we have that there exists a subsequence, which we denote again 
with {y*j^, q*p^, iz'^), that converges to {y*,q*,iz*) : [0,T] —)■ x Pi(R 4'^) uniformly with respect to 

t G [0,T]. Since by definition tti^iz"^ = p*j^, by the convergence of to p* proved in Proposition 
2.6, we get 'Ki^iz* = p* . Observe that {y*,q*, iz*) is a Lipschitz function with respect to time and iz* 
has support contained in B{0,Rt) for all t G [0,T]. Moreover, by the boundary conditions for each 
N, we have that y*(0) = y^, 7ri^(iz*(0)) = p^ and q*{T) = 0, 7r2^(n*(T))(r) = 6{r). 

Fix now t G [0, T]. To shorten notation, let E: R*^™ x R^ —)■ R be the functional, strictly concave 
with respect to u , defined as 

m 

E{q, u) = '^qk- BkU - -f{u ). 

k=l 
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Recall that by (3.1) and by Remark 3.4, u*j^{t) satisfies 

= aigmaxE{q}f{t),u), 

u£U 

since the maximum is uniquely determined by strict concavity. Since U is bounded, by definition 
E{-,u) is continuous uniformly with respect to u GU. The convergence of to q*{t) then implies 

that every accumulation point vt GlA of must satisfy 

vt = aig max E{q* {t),u) (5-1) 

new 

and is therefore uniquely determined. This shows that the sequence u*j^ is pointwise converging in 
[0,T] to the function v{t) := vt- Due to the boundedness oi U, we further have that u*j^ —)■ u in 
L^((0,T);Z^). Since u*j^ was already converging to u* weakly in L^((0,T);i^) it must be u*{t) = v{t) 
for a.e. t G (0,T), which together with (5.1) implies that 

—)• u* strongly in L^((0,T);ZY) (5-2) 


and that 


u*{t) = argmax E{q*{t),u) 
ueu 


for a.e. t G [0, T]. Due to the explicit expression of ]HI(y, q, u, u) in (1.4), this is equivalent to say that 


m{y*{t),q*{t),u*{t),u*{t)) 


arg max M.{y* (t), q* (t), u* {t), u) 
new 


for a.e. t G [0, T]. 

We finally prove that {y*,q*,v*) satisfies the Hamiltonian system (1.3) with control u*. Due to 
equi-Lipschitz continuity, we have that the derivatives {y*j^,q%), and converge to {y*,q*), and 

dtv*, respectively, weakly in L^([0, T]; and in the sense of distributions. Observe now that by 

(1.5) and (1.6) the vector held Vi,lHlc(y, q, w)(-, •), which is independent of u, is continuously depending 
on {y,q,u). By the uniform convergence of {y^^, q]^, and since supp(j^)(r(f)) C B{0,Rt) for all 
t G [0, T] we get that 

V:,Mc{y*j^{t), q*N{t), r) ^ V^M.c{y*{t), q*{t), u*{t)){x,r) 

uniformly with respect to t G [0,T] and (x,r) G B(0,Rt). From this, using again the narrow 
convergence of to w*(t) and since supp(r')(f(f)) C B{0,Rt), we then get the uniform bound 

II (TV,ylHic(y^(t),g^(t), r')(f(t))) w)(r(f)||^j_(]R-D,KU) < C'r, 

for some constant Ct independent of t G [0,T], as well as the narrow convergence 

(JV,He( 2 /^(t), ( 7 ^(t), - {JVMy*{t),q*{t), u*{t))) u*{t) 

for all t G [0, T]. Testing with functions (j) £ C?^([0, T] x R), the two above properties are enough 
to show that 

in the sense of distributions, so that ly* solves the third equation in (1.3). 

For all k = 1,... ,m, taking derivatives in the explicit expression in (1.4) and using the dehnition 
of He, we have that Vy^,Mc{y,q,i^,u) is actually independent of u and is continuous with respect to 
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the Euclidean convergence on (y, q) and the narrow convergence on measures v with compact support 
in a fixed ball Therefore, since , q%, converges to {y*,q*,i'*) uniformly with respect 

to f € [0, T ], and there is no dependence on u, for all /c = 1,..., m we have that 

in uniformly with respect to f G [0,T]. It then follows that q* solves the second equation in (1.3). 
A similar argument, also using the convergence of to u* proved in (5.2), shows that 

VgfcHc(2/Ar(t),^Ar(f),^')(r(t),^^;v(^)) qk^dv* {t), q* (t), u* (t), u* {t}) 

in L^([0,T];M'^) for all k = 1,... ,m, so that y* solves the first equation in (1.3). This concludes the 
proof of Theorem 1.1. 


6 An example 

In this section, we show the application of the extended Pontryagin Maximum Principle to a toy 
model for crowd interactions. The Cucker-Smale model, introduced in [18], was first studied in its 
mean-field limit form in [24]. It models the phenomenon of alignment of velocities in crowds, that can 
be observed, e.g., in flocks of birds. 

In this model, each agent is identihed by its position Xi and velocity Vi , and it adjusts its velocity 
by relaxing it towards a weighted mean of the velocities of the group. The weight is a nonincreasing 
function (p of the distance between individuals. In the original paper [18], the authors propose (p{X) = 
^ some fixed parameters K,a > 0 and /3 > 0. For our computations, we consider (/) G 

C^(M'^,M''') being a radial function. 

The finite-dimensional dynamics is given by the ODE system 


' Xi = Vi, 

1 ^ 




We add to it m leaders with positions and velocities given by {yk-,Wk) for every k = l,...,m, on 
which a control variable Uk is active. Since the control acts as an external force, Uk will directly affect 
the evolution of the velocities Wk only. The mean-field limit for N —)■ -|-oo of the resulting system is 
given by (see, e.g., [22]) 


Vk = Wk, 


Wk = fi){yk,Wk) + — '^4’{yk -yj)iwj - Wk) + Uk, 


i=i 


= -V ■ 


^-ky + — '^<P{x-yj){i 
m 


i=i 


where y = y{x,v) is the density of followers and <h(x,u) := (j){x){—v). 


case of (1.2), where the state variables for the leaders are yk ■= 


yk 

Wk 


k = 1,... ,m 

( 6 . 1 ) 

y , 

Notice, this is a particular 
and y = (yi,... ,ym), the 
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ones for the followers are x 


^ , and one chooses 


X(x) := 


0 

^>1x1 


5(y)(x) = 


E™ 1 -m))’ 




and, for every k = 1,... ,m, is the 2d x (dm) matrix that maps u = 


Ul 


Uri 


E into the 


element 


0 

Uk 


G . Notice that since (/> is a radial function, the function <h, and thus K , is odd. 


A standard problem in the study of the Cucker-Smale model is to find conditions to ensure flocking, 
i.e., alignment of the whole crowd to the same velocity. For this reason, it is interesting in our case to 
study the minimization of the variance^ of the crowd, by choosing 


f f 2 \ 1 r 

L{y,d)-=\ — ll^fclp + 2||u|M - —'^Wk+ vdfi{x,': 


I -E 

/«2. ^ 


2 + 2||u|P- 


;^E^^ + J v'dfi{x',v’)^ • + dfi{x,v), 


( 6 . 2 ) 


that is of the form L = Jjg2d^(y,x, J lo/j,) diJ,(x) by choosing w(x) = v and 


^(y,x,?) 


m 


'^\\wkf + 2\\vf 


k=l 


(1 E- 

— > W'fc + 

\ m 

\ k=l 




For the control constraints, we assume lA := [—1,and we choose to penalize the L^-norm of the 
control, hence 7 (u) := ||tt|p. 

Remark 6.1. Other forms for the cost L can be of interest. For example, on may want to drive the 
crowd to a given fixed velocity v. In this case, one can minimize 


Li{y,n) 



^ m 


-s||2 + 


\v — u 


k=l 


dfi{x, v), 


that is again of the form ^(y, x, f ujgi) dn{x ), with i not depending on its third variable, this time. 

Since Hypotheses (H) are clearly satisfied, we now apply the extended Pontryagin Maximum 
Principle to the optimal control problem with cost functional (6.2) constrained by the system (6.1). 
For simplicity of notation, we study the 1-dimensional problem, i.e., d = 1. We introduce the dual 
variables of and x denoted by = {qk,Zk) and r = (r, s), respectively. The Hamiltonian H in 

^For simplicity of computation, we consider minimization of 4 times the variance. 
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(1.4) can be found by direct substitution; 


1 


EI(y, q, i/, tt) = 7: / {s — s')(f){x — x'){v'— v) du{x',v',r', s') du{x,v,r, s) 

2 Jr8 

f / I ^ \ 

+ / I ru + s— ''^^ 4 >{x — yj){wj — v) \ di'{x,v,r,s) 


i=i 


+ ( QkWk + Zk (t){yk - x){v -Wk)di/{x,v,r, s) 

V 


+ ^ I Zk— ^ (p{yk - yj){wj - Wk) + ZkUk 


k=l 


i=i 


- i{y,x,Ju!y)dy{x.)-\u\ 


The optimal control can be explicitly computed by (3.2) as follows 

4(y,q) = 0^" 

I sign( 2 ;fc) otherwise. 

Denoting with y the first marginal of , the PMP dynamics of the state and adjoint variables is given 
by 


Vk = Wk, 

Wk = {^*y){yk,Wk) + ^ Er=i - yj)i'^j - Vk)+ ul{y,q), 

ik= ^ /r 4 s(f)'{x - yk){wk - v) dv{x, V, r, s) - Zk /r4 (p'iyk - x){v - Wk) dv{x, v, r, s) 

^ Ej^k Zj(t>'{yk - yj){wj - Wk), 

^k= - /r 4 (^s - Zk) (t>{x - yk) du{x, V, r, s)-qk + Ej^k Zj4>{yk - yj) 

-^Wk + ^ EJ=i Wj + ^Ir 2V dy{x, v), 

dtv = -y[x,v,r,s) ■ {{JV,,Mc{y,C[,U,U*))u) , 

where the components of the vector field V,y]HIc(y, q, i^, u*) are given at every point {x, v, r, s) G by 


ViMc-vi= / is — s')4>'ix — x'){v'— v) dvix',v',r',s') + s — 4>'ix — yj){wj — v) 

./h 4 m ^ 


j=i 


y~l Zk(l)'{yk -x){v - Wk), 


k=l 


Vi^BIc • 62 = - / {s- s')(j){x - x') du{x',v', r', s') + r - s— (j){x - yj) + Zk4>{yk - x) - 3v 

./b 4 m ^ 


i=i 


k=l 


H- '^Wk+ / vdy{x,v), 

m f-' ./ro 2 


k=l 


• 63 = u, 


VJ^c-ei = (^*/u)(x,u) + — - yj){wj - v). 


j=i 
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We remark that, as it happens for the standard PMP, the explicit computation of the third and 
the fourth components gives exactly the vector field determining the dynamics of fj ,, the first marginal 
of z/, in accordance with (6.1). 
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